REMARKS 

The above amendments have been made to place the appHcation in better form for 
examination. The attachment to this Prehminary Amendment entitled "Version with Markings 
to Show Changes Made" is a marked-up version of the changes made to the specification and 
claims. Applicant hereby requests an action on the merits at the earliest opportunity. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 



IN THE SPECIFICATION : 

Page 1, line 12, the paragraph is amended as follows: 

[1] S, Lawrence [L. Giles] and CX. Giles [S. Lawrence], "Accessibility [and 
Distribution] of Information on the Web," Nature, vol. 400, July 8, 1999. 

Page 1, line 18, the paragraph is amended as follows: 

[4] S. Brin, J. Davis, and H. Garcia-Molina, "Copy Detection Mechanisms for Digital 
Documents." Proceedings of the ACM SIGMQD Annual Conferencer^95L May 1995. 

Page 1, line 22, the paragraph is amended as follows: 

[6] N. Sh[r]ivakumar and H. Garcia-Molina, "SCAM: A Copy Detection Mechanism for 
Digital Documents," Proceedings of the Second International Conference in Theory and Practice 
of Digital Libraries, June 1995. 

Page 1, line 25, the paragraph is amended as follows: 

[7] N. Sh[r]ivakumar and H. Garcia-Molina, "Building a Scalable and Accurate Copy 
Detection Mechanism," Proceedings of Third International Conference on Theory and Practice 
of Digital Libraries , March 1996. 
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Page 2, line 4, the paragraph is amended as follows: 

[10] V. Chalana, A. Bruce, and T. Nguyen, " Duplicate Document Detection in 
DocBrowse [Mathsoft Data Analysis Products Division: DocBrowser]," 
www.statsci.com/docbrowse/paper/spie98/node l.htm, July 31, 1999. 

Page 2, line 6, the paragraph is amended as follows: 

[1 1] G. Salton, A. Wong [C.S. Yang], and C.S. Yang [A. Wong], "A Vector[-] Space 
Model for Automatic Indexing," Comm. Of the ACM, vol. 18, no.l L pp. 613-620, November 
1975. 

Page 2, line 8, the paragraph is amended as follows: 

[12] M. R Porter, "An Algorithm for Suffix Stripping," Program , vol. 14, no. 3, pp. 130- 
137, July 1980. 

Page 2, line 13, the paragraph is amended as follows: 

[14] R. S. Scotti and C. Lilly, " Analysis and Design of Test Corpora for Zero-Tolerance 
Government Document Review Process," Symposium for Document Image Understanding 
Technology, Annapolis, Maryland, April, 1999, also reported at George Washington University 
Declassification Productivity Research Center, http://dprec.seas.gwu.edu, July 31,] 1999. 
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Page 2, line 15, the paragraph is amended as follows: 

[15] D. Grossman, D. Holmes, and O. Frieder, "A Parallel DBMS Approach to IR in 
TREC-3[4]", Overview of the Third TFourthlText Retrieval Conference (TREC-3r41), November 
1994 [19951. 
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